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Appl.No. 10/805,803 
Amdt. dated March 4, 2009 
Reply to Office Action of November 4, 2008 

Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 
Listing of Claims: 

Please amend claims 1, 10, 1 1, 13, 15, 16, 18, 19, 22, 28, and 30 as follows: 
1 . (currently amended) A processor with an instruction class controllable pipeline 
comprising: 

a program storage unit holding a diverse plurality of class one and class two 
executable function instructions, each of the class one instructions capable of being executed in a 
first pipeline, each stage of the first pipeline having a first execution latency and each of the class 
two instructions capable of being executed in a second pipeline, each stage of the second pipeline 
having a second execution latency, wherein the first execution latency is shorter than the second 
execution latency; 

an adaptable fetch stage for fetching an instruction from the program storage unit to 
be stored in an instruction register; 

an adaptable decode stage for classifying and decoding the instruction stored in the 
instruction register and generating an instruction class indication and storing the decoded 
instruction in a decode register; 
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an adaptable execution stage for execution of the decoded instruction stored in the 
decode register, the decoded instruction being a class one instruction or a class two instruction; 
and 

an adaptable pipeline control unit responsive to the instruction class indication for 
adapting *he -a duration of latency of the adaptable fetch stage, the adaptable decode stage, and the 
adaptable execution stage to the first execution latency for the class one instructions and to the 
second execution latency for the class two instructions dependent on the instruction class 
indication. 

2. (previously presented) The processor of claim 1 , wherein the adaptable fetch stage 
further comprises: 

a program counter and an instruction memory fetch mechanism which are operable 
to begin instruction processing by fetching one or more instructions from the program storage unit. 

3. (previously presented) The processor of claim 1 , wherein the executable function 
instructions comprise: 

additions, subtractions, multiplications, divisions, compares, ANDs, ORs, 
ExclusiveORs, NOTs, shifts, rotates, permutes, bit operations, moves, loads, stores, 
communications or combinations thereof. 

4. (previously presented) The processor of claim 1 , wherein the adaptable pipeline 
control unit further comprises: 

a pipeline control mechanism for class one instructions to execute in the first 
execution latency; and 
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a pipeline control mechanism for class two instructions to execute in the second 
execution latency. 

5. (previously presented) The processor of claim 4 wherein the pipeline control 
mechanism for class one instructions further comprises: 

control for normal pipeline timing for class one instructions, wherein each stage of 
the normal pipeline has a duration equal to the first execution latency. 

6. (previously presented) The processor of claim 4 wherein the pipeline control 
mechanism for class two instructions further comprises: 

an instruction register feedback multiplexer; 

a decode register feedback multiplexer; 

a program counter and program counter update function; 

a hold instruction register signal to control the instruction register feedback 
multiplexer to hold the contents of the instruction register for the second execution latency upon 
detection of a class two instruction; 

a hold decode register signal to control the decode register feedback multiplexer to 
hold the contents of the decode register for the second execution latency upon detection of a class 
two instruction; and 

a hold program counter signal to control the program counter update function to 
hold the contents of the program counter for the second execution latency upon detection of a class 
two instruction. 
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7. (previously presented) The processor of claim 4 wherein the pipeline control 
mechanism for class two instructions further comprises: 

an instruction register gated clock; 
a decode register gated clock; 
a program counter gated clock; 

instruction register clock gating logic responsive to the instruction class indication 
to extend the instruction register gated clock period for the second execution latency upon 
detection of a class two instruction; 

decode register clock gating logic responsive to the instruction class indication to 
extend the decode register gated clock period for the second execution latency upon detection of a 
class two instruction; and 

program counter clock gating logic responsive to the instruction class indication to 
extend the program counter gated clock period for the second execution latency upon detection of 
a class two instruction. 

8. (canceled) 

9. (previously presented) The processor of claim 1 wherein the adaptable execution 
stage further comprises: 

a class one execution unit operable to execute a class one instruction stored in the 
decode register, wherein the class one execution unit has the first execution latency; and 

a class two execution unit operable to execute a class two instruction stored in the 
decode register, wherein the class two execution unit has the second execution latency. 
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10. (currently amended) The processor of claim 1 wherein the adaptable decode stage 
further operates to decode an opcode field to classify an instruction. 

1 1 . (currently amended) The processor of claim 1 wherein the adaptable decode stage 
further operates to decode an opcode field and decode of a data type field to classify an instruction. 

12. (previously presented) The processor of claim 4 wherein the adaptable pipeline 
control unit further comprises: 

a programmable clock gating mode indicator that specifies a normal clock gating 
mode and a slow down clock gating mode; and 

control for extending pipeline stage timing for both class 1 instructions and class 2 
instructions to execute in a longer time period than the second execution latency when the 
programmable clock gating mode indicator specifies a slow down clock gating mode. 

13. (currently amended) A method for processor performance and power optimization 
of an instruction class adaptable pipeline processor supporting at least two classes of instructions 
with a first class operable with a first latency for fetch, decode, and execute pipeline stages of the 
adaptable pipeline and a second class operable with a second latency for fetch, decode, and 
execute pipeline stages and where, the first latency is shorter than the second latency and where the 
instructions operable with the first latency can be specified to operate with the first latency or the 
second latency and where the instructions operable with the second latency can be specified to 
operate with the second latency, the method comprising: 

programming the instruction class adaptable pipeline processor creating an 
application program containing a mix of two classes of instructions to meet functional 
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requirements with a first plurality of instructions used in the program each operable with the first 
latency specified in a format encoding each of the first plurality of instructions as class 1 
instructions and with a second plurality of instructions used in the program each operable with the 
second latency specified in a format encoding each of the second plurality of instructions as class 2 
instructions; 

modifying the application program by changing, where appropriate, the format 
encoding[[s]] of a_class 1 instruction[[s]J to aformat encoding[[s]] that specify specifies the same 
functionality of the original class 1 instruction^]] but providesa class 2 instruction indication[[s]] 
when decoded to minimize power use while still meeting performance requirements of the 
application program; and 

executing the application program on the instruction class adaptable pipeline 

processor. 

14. (previously presented) A method for processor performance and power 
optimization of claim 13 further comprises: 

appropriately programming a programmable clock gating mode to cause a 
specifiable majority of the instructions of the class adaptable pipeline processor to execute at a 
longer latency than the second latency associated with the class 2 instructions, to further minimize 
power use while still meeting performance requirements of the application program. 

1 5. (currently amended) A very long instruction word (VLIW) processor with a 
plurality of instruction class controllable pipelines comprising: 
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a VLIW storage unit holding a diverse plurality of class one and class two 
executable function instructions located in multiple instruction slot format VLIWs, the class one 
instructions each having a first execution pipeline stage latency and the class two instructions each 
having a longer second execution pipeline stage latency; 

an adaptable VLIW fetch pipeline stage for fetching a VLIW from a VLIW storage 
unit to be stored in a VLIW instruction register (VIR); 

an adaptable decode pipeline stage for classifying and decoding the plurality of 
executable function instructions stored in the VIR, and generating an instruction class indication 
for each of the plurality of executable function instructions and storing decoded instructions in a 
plurality of instruction slot specific decode registers; 

a plurality of adaptable execution pipeline stages each operable for execution of the 
decoded instruction stored in an instruction slot specific decode register, the decoded instruction 
being a class one instruction or a class two instruction; and 

an adaptable pipeline control unit responsive to the instruction class indications 
from the classified plurality of executable function instructions for adapting the pipeline stage 
latency of the adaptable VLIW fetch pipeline stage, the adaptable decod e pipeline stage, and the 
plurality of adaptable execution pipeline stages of the plurality of instruction class controllable 
pipelines dependent on the instruction class indications. 

1 6. (currently amended) The VLIW processor of claim 1 5, wherein the adaptable 
VLIW fetch pipeline stage further comprises: 
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a VL1W memory control unit which is operable to begin VLIW processing by 
fetching a VLIW from the VLIW storage unit. 

1 7. (previously presented) The processor of claim 1 5, wherein the executable function 
instructions comprise: 

additions, subtractions, multiplications, divisions, compares, ANDs, ORs, 
ExclusiveORs, NOTs, shifts, rotates, permutes, bit operations, moves, loads, stores, 
communications or combinations thereof; 

1 8. (currently amended) The processor of claim 1 5, wherein the adaptable pipeline 
control unit further comprises: 

a pipeline control mechanism for a VLIW, the VLIW consisting of all class one 
instructions, to control the execution of the VLIW with the first execution pipeline stage latency: 
and 

a pipeline control mechanism for a VLIW, the VLIW consisting of at least one class 
two instruction, to control the execution of the VLIW with the second execution pipeline stage 
latency. 

1 9. (currently amended) The processor of claim 1 8 wherein the pipeline control 
mechanism for a VLIW consisting of all class one instructions further comprises: 

control for normal pipeline timing for the class one instructions , wherein each 
stage of the normal pipeline has a duration equal to the first execution pipeline stage latency. 

20. (canceled) 

21. (canceled) 

9 

PACE 13/28 * RCVD AT 3/4/2009 12:01:12 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-6/44 * DN1S:2738300 « CSID:9198061690 * DURATION <mm-ss):05-48 



9198061690 



11:50:18 a.m. 



03-04-2009 



14/28 



Appl. No. 10/805,803 

Amdt. dated March 4, 2009 

Reply to Office Action of November 4, 2008 

22. (currently amended) The processor of claim 1 5 wherein an adaptable execution 
pipeline stage of the plurality of adaptable execution pipeline stages further comprises: 

a class one execution unit operable to execute a class one instruction stored in the 
decode register, wherein the class one execution unit has the first execution pipeline stage latency; 
and 

a class two execution unit operable to execute a class two instruction stored in the 
decode register, wherein the class two execution unit has the second execution pipeline stage 
latency. 

23-27 (canceled) 

28. (currently amended) The processor of claim 1 wherein the duration of latency of the 
adaptable fetch stage and the duration of latency of the adaptable decode stage are both set equal to 
the duration of latency o f associated with the class of instruction that-is -to be e x e cuting executed in 
the adaptable execution stage. 

29. (previously presented) The processor of claim 1 wherein the adaptable execution 
stage is a variable duration single-cycle execution stage having a first duration of execution equal 
to the first execution latency for executing class one instructions and having a second duration of 
execution equal to the second execution latency for executing class two instructions. 

30. (currently amended) A processor with an instruction class controllable execution 
pipeline comprising: 

an instruction register for holding a class one instruction or a class two instruction, 
the class one instruction belongs to a first group of diverse class one instructions that are each 
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capable of being execute d in a first pipeline, each stage of the first pipeline having m-a first time- 
latency that is less than or equal to a first single-cycle time- clock p eriod, the class two instruction 
belongs to a second group of diverse class two instructions that are each capable of being executed 
in a second pipeline, each stage of the second pipeline having 4 n a second time -latency that is less 
than or equal to a second single-cycle time -clock period, the first single-cycle time- clock period is 
shorter than the second single-cycle time-clock p eriod; 

an adaptable decode pipeline stage for classifying and decoding the instruction 
stored in the instruction register, generating an instruction class indication representing the 
instruction class of the instruction received in the instruction register, and storing a decoded 
instruction in a decode register; 

an adaptable execution pipeline stage responsive to execute a decoded instruction 
stored in the decode register, the adaptable execution pipeline s tage having a first execution logic 
unit for executing instructions of the first group, a second execution logic unit for executing 
instructions of the second group; and 

an adaptable pipeline control unit responsive to the instruction class indication for 
adapting th e tim e period of the adaptable decode pipeline stage and the adaptable execution 
pipeline stage for decoding and executing an instruction in the first single-cycle clock t ime-period 
for class one instructions and in the second time- single-cycle clock period for class two 
instructions dependent on the instruction class indication. 
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